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Abstract 

We consider irregular product codes. In this class of codes, each codeword is represented by 
a matrix. The entries in each row (column) of the matrix should come from a component row 
^SJ (column) code. As opposed to (standard) product codes, we do not require that all component row 

codes nor all component column codes be the same. As we will see, relaxing this requirement can 
provide some additional attractive features including 1) allowing some regions of the codeword be 
more error-resilient 2) allowing a more refined spectrum of rates for finite-lengths and improved 
t-H performance in some of these rates 3) more interaction between row and column codes during 

I decoding. 

We study these codes over erasure channels. We find that for any < e < 1, for many rate 
distributions on component row codes, there is a matching rate distribution on component column 
HH codes such that an irregular product code based on MDS codes with those rate distributions on 

c/2 the component codes has asymptotic rate 1 — e and can decode on erasure channels (of alphabet 

i ^ i size equal the alphabet size of the component MDS codes) with erasure probability < e. 

> 1 Introduction 

\o 

^ Product codes were introduced in 1954 by Elias [6j. A product code can be viewed as a special case of 

Tanner construction [Hj in which smaller constituent codes make a larger code with low complexity 

\Q decoding. An mxn product code is defined by a row code C of length n and rate rc, and a column 

code C of length m and rate rc- Codewords are represented bymxn matrices which satisfy the 
constraint that every row belongs to C and every column to C . Product codes are decoded in an 
iterative fashion, where rows and columns are recovered in successive rounds using the decoders for 

• i— i C and C . The rate of the product code is the product of the rates rc and rc • 

In this work, we present irregular product codes, a generalization of product codes in which we 
do not require that the rows (columns) belong to a single code. We will show that while these codes 
still retain the advantages of product codes, they present some additional attractive features. 

One of the main advantages of product codes is the fact that decoding takes place over the 
smaller component codes, which can result in a speedup of decoding. Furthermore, by combining 
Reed-Solomon component codes, one can obtain product codes which have length equal to the 
square of the size of the component codes for the same field size, while taking advantage of the MDS 
properties of the small component codes. 

Another (more application-specific) feature of product codes is that they perform well on bursty 
channels. Indeed, for a product code which is transmitted row by row, a burst error will corrupt 
several consecutive rows but spread evenly over columns, thus allowing the column codes to recover 
the corrupted entries. 



1 



Irregular product codes are based on the simple idea that we need not restrict ourselves to a 
single row and column code, but instead allow row and column codes of multiple rates. The intuition 
behind this is that allowing for a few low-rate, highly error-resilient codes might boost the decoding 
process, while other high-rate codes ensure that the overall irregular product code has good rate. 
With a careful design of the rate distributions, one can hope to achieve better performance than 
for regular product codes. Irregularity has been a powerful concept in many contexts; e.g., irregular 
degree distributions for LDPC codes, LT codes, etc. This idea fully exploits the inherently interactive 
nature of the decoding of product codes. Indeed, round-based decoding of product codes lets some 
rows and columns "help" others to recover and go on with the decoding process. Allowing for 
various decoding capabilities for different rows and columns only taps further into this property of 
the decoder. Q 

Irregular product codes retain the advantages of product codes, while presenting additional fea- 
tures that make them more attractive. Decoding still takes place over smaller codes and the field size 
is still allowed to grow slower in the case of MDS component codes. Further, not only do irregular 
product codes still perform well on bursty channels, they can also be more powerful than regular 
product codes when some parts of the codeword are known to be more vulnerable to bursts than 
others, since the row and column codes error-correction capabilities are tunable. 

Moreover, for short-length linear codes, there do not exist product codes of every desirable di- 
mension, since fixing the dimension of the product code leaves few choices for the dimensions of the 
component codes. Irregular product codes, on the other hand, allow for many more dimensions due 
to the numerous choices for the rate distribution of the component codes. 

In this work, we first derive bounds on the rate and minimum distance of irregular product codes, 
and give constructions that achieve these bounds. We then give explicit families of irregular product 
codes that can get rates arbitrarily close to 1 — e on channels with erasure e based on MDS component 
codes. Note however that this does not mean that these codes are capacity-approaching in the sense 
of Shannon capacity because the field size for MDS codes can grow as a function of the lengthj^] 

We give simulation results for finite-length codes that show that irregular product codes have 
better thresholds than product codes of the same dimension or close dimension for some specific 
lengths. 

1.1 Related works 

Since the introduction of the product codes [6] many extensions have been proposed and these codes 
have found many applications from magnetic recording [1] to deep space communication [1J mainly 
because of their simple construction and low complexity decoding. 

The use of different component codes for rows and different component codes for columns is not 
new. In fact, [13] and [3J consider product codes for image transmission where the rows are LDPC 
codes and the columns are RS codes with different rates. They determine the optimum rate of the 

1 Indeed, in product codes that achieve rates close to Shannon limit (say on erasure channels), either the row/column 
code (say row code) should have rate close to 1. In this case, the decoding happens first in the column codes whose 
rate is far from 1, and then the row codes play a "complementary" role. As we will see, there exist irregular product 
codes with rate vs. decoding capacity matching these product codes in which the row codes and column codes have the 
same distribution of rates, and the decoding process involves a longer and gradual interaction between row and column 
component codes. 

2 On the other hand, one can show that our analysis can be extended to the situation where instead of MDS codes 
as component codes, we use capacity-approaching codes of the same rate but over a fixed erasure channel, say BEC. 
In this case, the resulting product code will be truly capacity-approaching. 
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RS codes by a dynamic programming. 

However, to the best of our knowledge, irregular product codes with the generality considered 
in this paper together with some of their asymptotic behavior have not been previously similarly 
explored. 

Multidimensional product codes are investigated in [lOj and [TT]. However, the component codes 
are restricted to be single parity and extended Hamming codes. In these papers, the authors devise 
a low complexity soft decoding algorithm for AWGN channels. 

The weight distribution of some instances of product codes is known. For example, [5] analyzes 
the error floor region of an extended Hamming product code by means of the weight enumerator of 
the code and the union bound. Some characterization of the stopping sets over the erasure channel 
is obtained in [12] based on the minimum distance of the component codes. [2] tries to optimize the 
design of a product code where the component codes are limited to single parity codes and certain 
extended Hamming and BCH codes. 

Product codes can be decoded iteratively using a message passing algorithm in noisy channels. 
Because of this, they are also referred to as turbo block codes [9] in the literature of coding theory. 

The Tanner graph of the product code is regular. [8j considers product codes as structured 
generalized LDPC codes. 

For a thorough survey on product codes refer to [7]. 

1.2 Organization of the Paper 

The remainder of the paper is organized as follows. In Section[2j we define irregular product codes. In 
Section[3j we derive an upper bound on their dimension, and prove that under certain conditions, this 
upper bound can be achieved. In Section |4j we also derive a lower bound on the minimum distance of 
irregular product codes and show that sometimes this lower bound is achieved. In Section [5j we turn 
to the asymptotic analysis of irregular product codes on erasure channels under the iterative decoding 
which switches back and forth between rows and columns. In Section [6j we give explicit families of 
irregular product codes based on MDS component codes that achieve rates close to what capacity- 
achieving codes achieve. Finally, in Section [JJ we give some irregular product code constructions for 
specific code lengths and show by simulation that these constructions outperform regular product 
code of the same (or approximately the same) dimension. 

2 Definition 

We denote the set {1, . . . , m} by [to]. 

Definition 1. Let ¥ be a field and let m,n be positive integers. For each i S [to] let C% be a code of 
length n over ¥ and for each j G [n] let Cj be a code of length m over ¥. 

The m x n irregular product code C = C({Ci}i, {Cj}j) is the code of length mn over ¥ such that 

C = {(Cij)ie[m],j£[n] |Vi (cjl, . . . , Cj n ) 6 C%\ Vj, (c±j, . . . , C m j) £ Cj}. 

In the above definition, when all the codes Cj corresponding to the rows are equal and all the 
codes Cj corresponding to the columns are equal, we obtain a standard product code. 
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3 Rate of Irregular Product Codes 



Theorem 2. Consider an m x n irregular product code C = C({Ci}i,{Cj}j). Let < a\ < ... < 
cL m < ti and < b\ < . . . < b n < m be two integer sequences. For i G [m], assume that the value of 
the first ai coordinates of any codeword in Ci can generate the remaining coordinates (in the sense 
that the values of these remaining coordinates are a function of the values of the first a« coordinates). 
Similarly, for each j G [n] , assume that the first bj coordinates of any codeword in C'j can generate 
the remaining coordinates. Then 



1. C has dimension at most 



j=l i=6,-_i+l 



where we define 6q := 0. 



2. If furthermore for all i G [m] , j G [n] , Ci is a linear code of dimension ai and C'a is a linear 
code of dimension bj, and C\ C • • • C C m and C[ C • • • C C' n , then C has dimension exactly kc 
as given by 

Proof. The coordinates of a codeword in C are all pairs G [m] x [n]. In the following, we will 
describe a procedure that returns some subset of these coordinates as "generating coordinates" . As 
we shall see and as their name suggests, one can generate the remaining coordinates of a codeword 
in C from these coordinates. The number of these generating coordinates will be an upper bound 
on the dimension of C. This is only an upper bound because there might be some settings of these 
generating coordinates that do not give rise to valid codewords. 

In this procedure, initially all coordinates are unmarked. Each coordinate will eventually be 
marked either as "generating" or as "determined". A row (column) where not all coordinates have 
been marked is called "available" . A row (column) whose marked coordinates can generate the values 
of the remaining unmarked coordinates is called "determined" . 

While there exists an unmarked coordinate 

(A) if there exists an available determined row 

• pick the available determined row with the smallest index 

• mark its unmarked coordinates as "determined" 

(B) else if there exists an available determined column 

• pick the available determined column with the smallest index 

• mark its unmarked coordinates as "determined" 

(C) else 

• pick the available row with the smallest index 

• starting from the smallest unmarked index, mark as many coordinates as is necessary as 
"generating" until the first Oj coordinates are marked 

• mark the remaining coordinates as "determined" . 
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Claim 1. In the above procedure, the number of coordinates finally marked as generating are 



Proof (of Claim\ly. For each row i, we count the number of generating coordinates in row i: 

Assume i > b n . During the procedure, rows with smaller index become determined earlier and 
hence become fully marked earlier also. Thus, before the procedure executes on row i, all rows 
— 1 have been fully marked, hence all the columns are determined. Hence, the columns 
one-by-one cause the procedure to go through (B), until row i becomes determined, at which point 
the procedure goes through (A) on row i. Hence, no coordinate in row i is ever going to be marked 
as generating. 

Now assume i < b n and consider the greatest j such that bj-i < i. Before the procedure 
executes on row i, all rows 1, . . . , i — 1 have become fully marked, hence all columns 1, . . . , j — 1 are 
determined. If a« < j, all these columns cause the procedure to go through (B) one-by-one until 
row i becomes determined, at which point the procedure goes through (A) on row i. Hence, in this 
case, no coordinate in row i is ever going to be marked as generating. If, on the other hand, at > j 
then consider coordinate Since the coordinates marked in a column are always a prefix of the 

column and since column j requires bj > i marked coordinates to become determined, the coordinate 
cannot be marked through (B) on column j (instead of through row i). By a similar argument, 
all coordinates + 1), . . . , (i, n) are going to be marked through row i (rather than through their 
columns). This implies that when the procedure executes on row i, coordinates (i,j), • • • , (i,ai) are 
not yet marked, and so the procedure goes through (C) on i, and exactly these coordinates are 
marked as generating. 

In other words, the number of generating symbols in row % is max(aj — j + 1, 0). This completes 
the proof of Claim [Tj □ 

From the way the kc generating coordinates were chosen, it is clear that the value of a codeword 
of C is a function of its value at these kc coordinates. This finishes the proof of part 1 of Theorem [2j 
To prove part 2 of Theorem [2j we show that under the conditions of part 2, the above procedure 
naturally gives rise to a systematic encoding algorithm for code C: When it marks a coordinate as 
generating, it can place an information symbol in this coordinate; when it marks a coordinate as 
determined while executing on a row (column), the value at this coordinate is generated from the 
generating coordinates of this row (column) according to the corresponding row (column) code. We 
only need to show that any setting of the kc generating coordinates gives rise to a valid codeword of 



This algorithm begins with an empty m x n matrix (cjj) corresponding to a codeword and fills 
its entries until all entries are filled and we have a matrix (cy) G F mxn . To show that the final 
matrix (c^) is a valid codeword, we prove by induction on the number of steps of the algorithm that 
(cij) never violates any row code or column code. By that we mean that for every row i (column j), 
at any point during the algorithm the filled entries in row i (column j) are a projection of a valid 
codeword in Cj (C'-) on these entries; in other words, these filled entries do not satisfy any linear 
constraint that is not satisfied by Cj {C'A. 

No entry Cy will ever violate its row code. A proof of this claim goes as follows: Since determined 
rows are given precedence over determined columns, Cij is never filled through column j if j > a%. 
Indeed, if j > a^, row i must have been already determined at the point where c^ is filled. It means 



rt 



b 





3=1 i=6 J -_ 1 +l 



C. 
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that when row i is picked by the algorithm, at most its first Oj entries are filled. Since Cj is generated 
by its first dj coordinates and has dimension dj, any setting of these coordinates will correspond to 
the projection of some valid codeword on its first a« coordinates. 

Thus, the only case we need to consider is that of an entry c^ violating its column code Cj (when 
row i is being filled). We claim that this can also never happen. Let Cij be the first entry that 
violates its column code so that for all %' < i and all j' < j, the entries Cj/j/, c^-, and Cij> do not 
violate their respective column codes. As Cij violates Cj, we must have that bj < i. Since Cj has 
dimension bj and is generated by its first bj coordinates, there exists (/3i, . . . G F* _1 such that 

for any valid codeword {y\, . . . ,y n ) of Cj, we have yi = but 

% ^(/3 ! c 1 ... i _ lii ). (2) 

Since C| C • • • C Cj, this implies that for each of the first j — 1 columns, its first i coordinates 
correspond to the projection of a valid codeword of Cj on its first i coordinates. Thus, for each 
j' < j, we have that 

Cij' = (/3,ci...i_ij/). (3) 

On the other hand, since C\ C • • • C Cj and j < aj, a similar argument shows that there exists a 
vector a G F J_1 such that for all i! < i, 

Ci'j = (<x,Ci>,l-j-i)- (4) 

Using d3| and Q, we see that = J2i<i'<i l<j'<j a i'Pj' c i'j'- But using (|2| and (j3j), we see that 
c ij XJi<i'<i i<j'<j a i'Pj' c i'j'- This contradiction shows that no violates a column code. □ 



4 Minimum Distance of Irregular Product Codes 

The following theorem gives the best general lower bound on the minimum distance of an irregular 
product code in terms of the minimum distances of the individual row and column codes. Notice 
that this does not preclude the possibility of obtaining better lower bounds if we know more about 
the row and column codes. 

Theorem 3. For two integer sequences n > d± > . . . > d m > 1 and m > d\ > . . . > d' n > 1, define 

i' i' 

D= min max — ii' — i + l)(f — j + 1) + > d^ + > d' k . 

IKiKm-dj+l-^KjKn-di+l i-l<i' <m;j-l<j' <n ^ ' 

k=i k=j 

The number D is the minimum weight of a binary nonzero m x n matrix where every nonzero row 
i has weight > di and every nonzero column j has weight > d'j. Therefore, if C = C({C}j, {Cj}j) 
is an m x n product code such that mindist(Ci) = di and mindist{C'j) = d'j, then mindist{C) > D. 
On the other hand, for any two sequences n > d\ > . . . > d m > 1 and m > d[ > . . . > d' n > 1, 
there exist row codes Ci and column codes Cj with mindist(Ci) = di and mindist{C'j) = d'j, such that 
mindist(C) = D. 

Proof. Consider a minimum weight binary nonzero m x n matrix M where every nonzero row i has 
weight > di and every nonzero column j has weight > d'j. Because the sequence of diS and the 
sequence of djs are sorted nonincreasingly, we can permute the rows and columns of M in such a 
way that the nonzero rows become rows i, i + 1, . . . , m for some 1 < i < m and the nonzero columns 
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become columns j,j + l,...,n for some 1 < j 1 < n while still preserving the property that for each 
(nonzero) row i" € [i,m] (column j" € [j, n]) has weight > dj» (> dj"). Now, consider the submatrix 
of M consisting of the intersection of rows i, . . . ,m and columns j, . . . ,n. We want to minimize the 
number of ones in this submatrix. Instead we look at the problem of maximizing the number of zeros 
in this submatrix, which can be expressed as a max-flow problem where for each i" € [i,m] there is 
an edge of capacity n — j + 1 — dj» from the source to a vertex that corresponds to row i", for each 
j" £ [j, n] there is an edge of capacity m — i + 1 — d'-„ from a vertex that corresponds to row j" to 
the sink, and there is an edge of capacity 1 from each row i" to each row j". Using the fact that 
the min-cut equals max-flow, one can show that the minimum number of ones in this submatrix is 
indeed 

i' i' 

;4 + ^4-(i'-i + i)(/-j + i). 



max \ i 

i—l<i'<m, j—l<j'<n 

k- 



Since row i has at least di ones, we have d{ < n — j + 1. Similarly, dj < m — i + 1. Minimizing over 
all i and j, we get that D is the weight of M. 

Now, we can deduce that for any two distinct codewords in the product code C, since they differ 
on at least di (dj) coordinates in every row i (column j) in which they differ in at least one coordinate, 
they have Hamming distance > D. 

Finally, assume the sequences di,...,d m and d'i_,...,d' n are given. Find a weight-D matrix 
M e {0, l} mxn where each nonzero row i has weight > di and each nonzero column j has weight 
> d'j. For each zero row i, we define Cj to be any linear code of minimum distance d{. Similarly, for 
each nonzero row i, we want to find a code Cj of minimum distance di such that row i of matrix M is 
a codeword in Cj. An [n, k = n — di + 1, <ij]-Reed-Solomon code has at least one codeword of weight 
w for each w 6 [di, n] (because the degree-(/c — 1) polynomial (x — a\) k+w ~ n (x — 02) . . . (x — a n - w ) 
has exactly w non-roots among distinct elements a.\, . . . , ot n of a field.) Thus, we can multiply each 
codeword coordinate of such a Reed-Solomon code by an appropriate nonzero field element in such a 
way that row i of the zero-one matrix M is a codeword in the resulting code Cj of minimum distance 
di. Similarly, For each zero column j, we define C'j to be any linear code of minimum distance d'j. 
Similarly, for nonzero columns j, we can find column codes Cj of appropriate minimum distance d'j 
such that row j of matrix M is a codeword in C'j. Finally, we need to choose the same symbol field 
for all these codes C% and Cj. We can choose the field to be ¥ q for some q > max(m, n). Then, the 
minimum distance of C = C({Ci}i, {C'j}j) is D. 

□ 



5 Asymptotic Analysis of Decoding Irregular Product Codes on 
Erasure Channels 

We need the following definition for the next theorem. 

Definition 4. Consider an m x n irregular product code C = C({Ci}i, {C'j]j). We are interested in 
the asymptotic behavior of C, therefore we think of C not individually but as one member of a family 
of irregular product codes where m and n grow. Suppose that a, (3 : [0, 1] — > [0, 1] are non- decreasing 
real functions. We say that the row and column codes have asymptotic normalized minimum distance 
distribution a and j3 if for every 81,82 > 0, for large enough m and n, for each i £ [m],j £ [n] we have 
\mindist(Ci)/n — a(x)\ < 8\ for some x such that\l — i/m — x\ < 82 and \mindist(C'j)/m — f3(y)\ < 8\ 
for some y such that |1 — j/n — y\ < 82- 
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Theorem 5. Assume an m x n product code C = C({Cj}j, {Cj}j) having asymptotic normalized 
minimum distance distribution a and (3 as in Definition [^} Assume that neither of m or n grows 
exponentially or faster in terms of the other one. Consider that a codeword in C is sent over an 
erasure channel where each symbol is erased with probability e > 0. We iteratively decode row codes 
and column codes of C whenever the number of erasures in a row or column is smaller than the 
minimum distance of the code corresponding to that row or column. Assume that 

a- l {ep- 1 {ex)) < x for all x E (0, 1], (5) 

where we define = sup(5 x ) for S x = {z E [0, 1] : j3(z) < x] if S x / and we define /3 _1 (x) = 

if S x = 0- We define a" 1 similarly. Then for any constant 5q > 0, for large enough codes in the 
family, all except a b^-fraction of the symbols can be decoded except with a probability exponentially 
small in min(m, ra). 

Proof. Let y = /3~ 1 (e). Notice that y only depends on j3 and does not depend on m and n. Let 
1 > y' > y, where we assume that y' also is a number independent of m and n. We claim that for 
large enough m and n, with very high probability all except the last y'-fraction of the columns can 
be decoded in the first step. To see this, let y" E (y, y'). We know (3(y") > e, hence for some e' > e, 
for large enough codes in the family, we have mindist(Cj) > e'm when n — j < y'n, i.e., for the last 
y'-fraction of the columns. The probability that each of the length-m codes corresponding to these 
columns cannot be decoded is exponentially small in m by the Chernoff bound, since these codes can 
decode up to e'm — 1 erasures while we have on average em erasures. Since the number of columns 
does not grow exponentially in m, we can use a union bound to derive our claim. 

Next we define x\ = a~ 1 (ey). We claim that for any 1 > x' > x\ with very high probability, all 
except the last x' fraction of rows can be decoded. To see this, let x" £ (xi,x'). We know a{x") > ey, 
hence a(x") > ey" for some y" > y. We can conclude that mindist(Cj) > ey"n when m — i < x'm, 
i.e., for the last x'-fraction of the columns. On the other hand, if we choose y' £ (y,y"), by the 
previous paragraph with high probability all the symbols not appearing in the last y'-fraction of the 
columns are decoded for large enough codes. Therefore, the average number of undecoded symbols 
at each row is at most ey'n. Again, we can derive our claim by a union bound on Chernoff bounds. 

Repeating the above argument back and forth between rows and columns, we get a non-increasing 
sequence a^o = l,xi,X2, . . . where Xi + \ = a _1 (e/3 _1 (exj)). Here 1 — Xi denotes the approximate 
fraction of rows that are guaranteed to be decoded after i back-and-forth rounds of decoding. If this 
sequence converges to 0, then 5o > X{ for some i. That would mean that with high probability, at 
most a <5o-fr ac ti° n of the rows and hence at most a <5o-fraction of all the symbols are not decoded by 
the end of the algorithm. 

So we just need to check that the monotonic sequence xq,x\,X2, ■ ■ ■ converges to if condition 
([5]) is satisfied. Assume otherwise that x* = lim^oo X{ > 0. We have 

a~ 1 (e/9~ 1 (ex*)) = cr 1 ^/ 3 " 1 ^ lim a*)) = Hm aT 1 {e$~ x {ex$) = x* 

i— >oo i— >co 

because a" 1 and can be shown to be right-continuous. This contradicts condition ([5]). □ 

6 Irregular Product Codes from MDS Codes 

Proposition 6. Consider an irregular product code C = C({Cj}j, {Cj}j) where Ci is an [n, ai]-MDS 
code and Cj is an [m,bj]-MDS code for all i,j. If a\, . . . , a m and bi,...,b n are non-decreasing 
sequences, then the dimension of C is upper-bounded by formula |7p. 
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Figure 1: This figure shows how the curve for a is obtained from the curve for f3 in Theorem |7| We 
stretch the curve for (5 vertically by a factor of 1/e, and then shrink the curve for a horizontally by a 
factor of e. That is, whenever x = f3(y)/e, we have a(x) = ey. The area of the shaded region denotes 
the asymptotic rate of the code, which is 1 — e. 



Furthermore, given any two integer sequences < a± < . . . < a m < n and < b\ < . . . < b n < m, 
we can meet this upper-bound in the following way. Choose n distinct elements ati, ■ . ■ ,a n and 
m distinct elements fix, ... ,/3 m of the symbol field F. Let V be the a m x n Vandermonde matrix 
Vij = a* -1 and V be the b n x m Vandermonde matrix — /3j l . Let Ci be the Reed-Solomon code 
having as generator matrix the first a% rows of the matrix V and Cj be the Reed-Solomon code having 
as generator matrix the first bj rows ofV. Then the dimension of C is given exactly by formula |lp. 

Proof. Any a% coordinates of Cj (and in particular the first a« coordinates) can generate the rest 
of the coordinates. Similarly, the first bj coordinates of C'- generate the rest of the coordinates. 
Therefore, by Theorem [2j the rate of the irregular product code is upper-bounded by formula ([!]). 

Now assume Cj has as generator matrix the first rows of the Vandermonde matrix V. Then for 
i < i! we have a» < a^, hence Cj has a generator matrix which is a submatrix of a generator matrix 
of Cj/, and hence Cj is a subcode of Cj/. We can argue similarly about the column codes. This shows 
that by Theorem [2j the exact rate is given by formula Q. □ 

Theorem 7. For each e > 0, the following is a generic way of constructing families of irregular 
product codes with asymptotic rate 1 — e such that for any constant 5 > one can decode almost all 
the symbols of a codeword sent over an erasure channel having erasure probability e — 5: 

Choose any non- decreasing function j3 : [0,1] — > [0,1] with /3(1) < e and lim^o P{v) = 0. 
Define a : [0,1] — > [0,1] by a{x) = e(3~ 1 (ex) where /3 _1 is defined in Theorem^ Choose m and 
n as you wish but neither of m or n should grow exponentially or faster in the other one. Then 
choose < ai < ... < a m < n and < b\ < . . . < b n < m as you wish but in such a way that 
aj = n(l — a(l — i/m + o(l))+o(l)) and bj = m(l — (3(1 — j/n + o(l)) + o(l)). Then choose the row 
codes Ci to be nested linear MDS codes of dimension aj (for example as in Proposition^. Choose 
similarly column codes Cj of dimension bj. 

Proof. For MDS codes Cj, we have mindist(Cj) = n — ai + 1. Therefore the minimum distance of 
the row codes can be approximated correctly using a as in the statement of Theorem [5] We can do 
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similarly for the column codes. In order to show we can decode from e — 5 > fraction of errors, it 
is enough by Theorem [5] to check that 

a- 1 ((e-<5)/3- 1 ((e-5)x)) > x 

does not happen for any x S (0, 1]. If it does for some x, then for any < x' < x, we have 

ef3- 1 (ex') = a(x') < (e - <J)/3 _1 ((e - 5)x). 

Since /3 _1 (ex') > /3 _1 ((e — 6)x) for all x' G [(e — 5)x/e,x), this implies (3 (ex') = for all such x'. 
This implies f3~ 1 (ex') = for all x' £ (0,x) and this contradicts lim^o /3(y) = 0. 

Now we want to calculate the asymptotic rate. By Theorem [6j the dimension of the code is given 
by formula ([!]), which can be expressed as 

ai - max({j : bj < i}) 
mn Ej [max( , 0)J . 

Thus, the rate is asymptotically equal to 

l r i 

max(/3 -1 (x) -a(x),0) dx = / - e^ 1 (ex)) dx 



x=0 Jx=0 



rHx) dx-e /3 _1 (ex) dx 

x=0 Jx=0 

[(l-e)+ / f3-\x)dx]-[ p~ 1 {x)dx 
1 - e. 



□ 

We note that the only regular products codes based on MDS codes which have decoding properties 
asymptotically as good as those constructed in Theorem [7] are codes where either a = or (3 = 0. 
These are regular product codes in which the row codes or column codes have rate 1 — o(l). 



7 Examples of Finite-Length Irregular Product Codes 

In order to find an example of an irregular product code for finite but not so small lengths, say 50 x 50, 
we used the asymptotic irregular product code shown in Figure [2a| obtained from Theorem[7j in which 
a(x) = ex and f3(y) = ey where e is the erasure probability. The area of the shaded region, which 
represents the systematic part of the code, is the rate 1 — e of the code. 

Next we slightly tuned the asymptotic code to a 50 x 50 irregular product code in such a way 
that 

• the code can start decoding better, by increasing the number of row and column codes having 
the highest minimum distance by a few; 

• more importantly, the code has a much higher probability of decoding all symbols once most of 
the symbols have been decoded, by forcing that the minimum distances of all row and column 
codes are at least some positive number, in this case 3. 
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(a) Asymptotic 



(b) Finite length 



Figure 2: a) The shaded region corresponds to the systematic or information part of the code by 
choosing a(x) = ex and f3(y) = ey where e is the erasure probability, b) Systematic part of a 
[2500, 1709] irregular product 




0-27 0.26 0.25 0.24 0.23 0.22 



Figure 3: Comparing an irregular [2500, 1709] code to almost equal rate regular ones. The number 
on the legends indicate the corresponding row and column dimensions. 

We chose all row codes and all column codes to be nested MDS codes according to Theorem [6} The 
resulting code has a systematic part which is shown in Figure |2b} 

This code is a [2500, 1709] code of rate 0.6836. We compared this code to all regular product 
codes having rates [0.6708,0.684]. Note that most of these codes have rate even lower than this 
code. The result of the simulation is shown in Figure [5J This plot shows the block/word error rate 
of the code in an erasure channel with erase probability e. All constituent row and column codes 
in irregular and regular cases are considered to be MDS with the corresponding dimensions. Each 
point of these curves is obtained by 10 6 simulations. The erasure patterns for different values of e 
have been coupled such that the block error versus erasure probability curve is monotonic. One can 
see that this code outperforms all product codes having lower rates. 

Figure [4] shows another case where an irregular code outperforms a regular code for a much 
smaller length. We compared a regular [8x8,4x7] product code with an irregular code which is 
shown in Figure |4b} Numbers on rows and columns indicate the dimension of the corresponding row 
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(a) Error curve 



(b) Systematic region 



Figure 4: Comparing an 8 x 8 regular and irregular product codes both with dimension 28 



and column MDS code. The block error probability of these codes is shown in Figure 4a Both codes 
are [64, 28] codes. 
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